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Abstract 

We propose adaptation strategies to modify the standard constrained model pre- 
dictive controller scheme in order to guarantee a certain lower bound on the degree 
of suboptimality. Within this analysis, the length of the optimization horizon is 
the parameter we wish to adapt. We develop and prove several shortening and pro- 
longation strategies which also allow for an effective implementation. Moreover, 
extensions of stability results and suboptimality estimates to model predictive con- 
trollers with varying optimization horizon are shown. 

Keywords: nonlinear model predictive control, suboptimality, stability, 
adaptation strategies 



1. Introduction 

Nowadays, nonlinear model predictive controllers (NMPC), sometimes also 
called receding horizon controllers (RHC) are used in a variety of industrial ap- 
plications, cf. 0]. As shown in lfT2l[P7Tl . theory for such controllers is also widely 
understood both for linear and nonlinear systems. The control method itself deals 
with the problem of approximately solving an infinite horizon optimal control 
problem which is computationally intractable in general. Reasons for its success 
are on the one hand its capability to directly incorporate constraints depending 
on the states and inputs of the underlying process. On the other hand, the funda- 
mental steps of this method are very simple: First, a solution of a finite horizon 
optimal control problem is computed for a given initial value. In a second step, the 
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first part of the resulting control is implemented at the plant and in the third and 
last step, the finite horizon is shifted forward in time which renders this method 
to be iteratively applicable. As a consequence, the control which is applied at the 
plant is a static state feedback. 

Due to considering only finite horizons, the inherent stability property of the 
infinite horizon problem does in general not carry over to the NMPC problem. 
To cope with the stability issue, several solutions have been proposed in the past, 
i.e. by imposing endpoints constraints IfTOll or adding so called Lyapunov function 
type endpoint weights and a terminal region to the NMPC problem [2] . A third 
idea deals with the plain NMPC problem without the requirement of added con- 
straints or a modified cost function. To show stability of the resulting closed loop, 
in E 13 a relaxed Lyapunov inequality is assumed. 

In either case, the horizon needs to be chosen as a worst case scenario which is 
usually needed to cope with small regions of state space only. Our aim in this work 
is to develop online applicable adaptation strategies for the horizon length which 
guarantee stability of the closed loop. Here, we follow the third approach since the 
original intention of the infinite horizon cost stays untouched, and make use of the 
suboptimality estimates given in 0. Based on the structure of these suboptimality 
estimates and on the structure of the NMPC problem itself, we propose several 
techniques to fit the horizon to the control task, the current state of the system 
and also to the internal information of the NMPC controller itself. Due to the 
change of the structure of the controller, however, known stability proofs and 
suboptimality results flfSEllHlEll cannot be applied. To cover these issues, stability 
results with varying optimization horizons are presented. 

To some extend adaptation strategies of the horizon are known in the literature, 
see, e.g., fl5l IT9J . In contrast to these pure heuristics, our approach can be proven 
rigorously and doesnot require any insight into the process under consideration. 
Moreover, a change of the cost functional is possible without modification of the 
adaptation law which allows for testing various settings of the controller. Last, the 
quality of the resulting closed loop is tunable by a single variable characterizing 
the allowable tradeoff compared to the infinite horizon optimal control law. 

The paper is organized as follows: In Section [2] we describe the problem setup 
and state the a posteriori and a priori suboptimalty estimates which will be the 
foundation of our analysis. In the following Section |3j we first show how known 
stability results and estimates can be extended to the case of varying optimization 
horizons. In Section[4j we develop various shortening and prolongation strategies 
based on the suboptimality estimates from [7]. To show applicability and effec- 
tivity of the proposed methods we present numerical results in Section [5} Finally, 
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Section [6] concludes the paper and points out directions of future research. 

2. Setup and Preliminaries 

In this work we consider nonlinear discrete time systems of the form 

x(n + 1) = f(x(ri), u(n)), x(0) = xo (1) 

with x(n) e X and u(n) e U for n e No where No denotes the natural numbers 
including zero. In this context, the state space X and the control value space 
U are arbitrary metric spaces. Therefore, all presented results also apply to the 
discrete time dynamics induced by a sampled infinite dimensional system. State 
and control constraints can be incorporated by replacing X and U by appropriate 
subsets X c X and U c U. Here, we denote the space of control sequences 
u : N — » U by U N ° and the solution trajectory for given control u e U N ° and 
initial value xq e X by x u (-, x ). 

The task which we pursue is to find a static state feedback u = fi(x) e U N ° for 
a given control system ([T]) which minimizes the infinite horizon cost functional 
Jcoixo, u) = 2JT=o K*u{n, xq), u(n)) with stage cost I : X x U — » Eq where Mq 
denotes the nonnegative real numbers. The optimal value function for this problem 
is denoted by Ko^o) = inf t(eU N Jcoixo, u). Moreover, one can prove optimality of 
the infinite horizon feedback law /i(-) given by 

H(x(nj) = argmin {Voo(x M (l, x(n))) + l(x(n), u)} (2) 

using Bellman's optimality principle for a given optimal value function. Here we 
use the argmin operator in the following sense: for a map a : U — » E, a nonempty 
subset U c U and a value u* e U we write 

u* = argmin a(u) (3) 

ueV 

if and only if a(u*) = inf ue ^a(u) holds. Whenever ([3]) holds the existence of 
the minimum mm ue ^ a(u) follows. However, we do not require uniqueness of 
the minimizer u*. In case of uniqueness equation @ can be understood as an 
assignment, otherwise it is just a convenient way of writing "w* minimizes a(w)". 
Here we assume that the minimum with respect to u e U is attained. 

Since the computation of the desired control law requires the solution of a 
Hamilton-Jacobi-Bellman equation, we use a model predictive control approach 
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in order to avoid the problem of solving an infinite horizon optimal control prob- 
lem. The NMPC methodology is simple and consists in three steps which are 
repeated at every discrete time instant during the process run: Upon start of each 
iterate, an optimal control for the problem on a finite horizon is computed. Then, 
the first element of the control is implemented at the process and in the third step 
the entire optimal control problem considered in the first step is shifted forward in 
time by one discrete time instant, see, e.g., |fT2| for an overview of this method. 

Concerning the computing step of the control law, we consider a finite horizon 
optimal control problem, that is we minimize the truncated cost functional 

N-l 

J N (x Q , u) = 2_j IMk, Xq), u(k)). (4) 

k=0 

For reasons of clarity, we denote the closed loop solution at time instant n by 
x(n) throughout this work while x u {-, x ) denotes the open loop trajectory of the 
prediction. Moreover, we use the abbreviation 

u N (-,x ) = argmin 7 w (x , m) and u N (x ) = u N (0, x ) (5) 

U€V N 

for the minimizing open loop control sequence of the truncated cost functional 
and its first element respectively. Moreover, we denote the optimal value function 
of the finite cost functional Q by V N (x ) = min HeU « J N (x , u). 
Given the initial value x m (0, xo) = xq, the open loop control ([5]) induces the open 
loop solution 

x UN (k+ l,x ) = f(x UN (k,x ),u N (k,xo)), Vke {0,...,N- 1}. (6) 

Via the implementation and shift steps of the NMPC controller described earlier, 
we obtain a feedback control //#(•) which can be defined via Bellman's principle 
of optimality 

fi N (x(n)) = argmin {V N -i(x u (l,x(n))) + l(x(n), u)} . (7) 

well 

Using the feedback the closed loop system is given by 

x(n + 1) = f(x(n),fx N (x(n))) , x(0) = x , n e N . (8) 

In the following, we are interested in the stability and suboptimality properties of 
the closed loop solution (|7]), ((8]). Note that due to the truncation of the infinite 
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horizon cost functional, stability and optimality properties induced by the infinite 
horizon optimal control ([2]) are not preserved in general. Here, we focus on the 
NMPC implementation without additional stabilizing endpoint constraints or a 
Lyapunov function type endpoint costs and a terminal region which are outlined 
in, e.g., IfTOl and [0 respectively. 

Our aim in this work is to show that the requirement of considering the worst 
case optimization horizon N for all initial values x e X can be weakened with- 
out loosing stability of the closed loop @, ([8]). Additionally, we show that the 
resulting closed loop trajectory satisfies locally a predefined degree of subopti- 
mality compared to the infinite horizon solution ([I]), ([2]) with u(ri) = fi(x(n)). To 
this end, we compare the infinite horizon cost induced by the NMPC control law 
fJLjfi'), that is Vn(xd) := 2,?=o I (x(n),/u N (x(n))), and the finite horizon cost V N (-) or 
the infinite horizon optimal value function V«,(-)- In particular, the latter gives us 
estimates about the degree of suboptimality of the controller //#(•) of the NMPC 
process. 

Note that since we do not assume terminal constraints to be imposed, feasi- 
bility of the NMPC scheme is an issue that cannot be neglected. In particular, 
without these constraints the closed loop trajectory might run into a dead end. To 
exclude such a scenario, we assume the following viability condition to hold. We 
like to note that in case of stabilizing endpoint constraints IfTOll or a terminal region 
[|2) this assumption holds implicitely. 

Assumption 1. For each xeX there exists a control weU such that f(x, u) e X. 

In order to derive adaptation strategies for the horizon length in this setting, we 
make extensive use of the suboptimality estimates derived in [7J. Methods to eval- 
uate these estimates rely on a rather straightforward and easily proved "relaxed" 
version of the dynamic programming principle, see also Etui- For proofs of the 
following estimate see [0 Proposition 3]. 

Proposition 2. Consider a feedback law /j.n '■ X — > U and its associated trajectory 
x(-) according to ([8]) with initial value x(0) = xo el // there exists a function 
V N : X — » M.q satisfying 

V N (x(n)) > V N (x(n + 1)) + al(x(n), /%(x(n))) (9) 

for some a e [0,1] and all neNo then 

aVMn)) < aV^{x{n)) < V N (x(n)) < V^n)) (10) 

holds for all n 6 No. 
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Note that Proposition @ is an a posteriori estimate since V N (x(n + 1)) is not 
available at time n. A more conservative a priori estimate is given in [7, Theorem 
7] using the following assumptions: 

Assumption 3. For given N, N e N, N > N > 2, there exists a constant y > 
such that for the open loop solution x UN (k, x(n)) given by ([6]) the inequalities 

V$(x UN (N-N,x(n))) . 
< max l(x UN (N - j,x(n)),fij-i(x UN (N - j,x(n)))) 

7 + 1 j=2,...,N 

Vk(x lw (N - k, x(n))) 

-^-^ / < l(x UN (N - k, x(n)),fi k (x UN (N - k, x(n)))) 

y + 1 

hold for all k e {N + 1, . . . ,N} and all n e N . 

Theorem 4. Consider y > and N, N e N, N > N such that (y + 1)^ > y N ^ +2 



holds. If Assumption\3jis fulfilled for these y, N and N, then ( fTOp holds for all 
n eNo where 

(y + l)^-y^ 2 

a := : . (11) 

(y+l)^ 

Comparing the estimates from B71|9l[T8]|, we call the maximal value of a satisfying 
@ local suboptimality degree if x(n) e X is fixed. For a given closed loop trajec- 
tory x(-) we call a := maxla | ((9]) holds Vn e N } the closed loop suboptimality 
degree and for a given set X we call a := max{ar | (|9j) with x(n) = x holds Vx 6 X} 
the global suboptimality degree. 

Unfortunately, we cannot expect the relaxed Lyapunov inequality (|9]) or As- 
sumption[3]to hold in practice for the following reason: In many cases the discrete 
time system ([T]) is obtained from a discretization of a continuous time system, e.g. 
sampling with zero order hold, see lfT3l HM . Hence, even if, e.g., a continuous 
time system is stabilizable to a setpoint x* and no numerical errors occur dur- 
ing optimization and integration, the corresponding sampled-data system is most 
likely practically stabilizable at x* only. 

For this reason, the a posteriori and a priori estimates from Proposition [2] and 
Theorem [4] have been extended to cover the case of practical stability as well, 
see also [0 Proposition 14 and Theorem 20]. Since these practical suboptimality 
estimates can be used in a similar manner as in the non-practical case, we show 
stability of the horizon adaptation technique and adaptation strategies for the non- 
practical case only. Corresponding results can be found in [fT6l Chapter 3 and 4]. 
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3. Adapting the NMPC Scheme 

Since the suboptimality estimates from Proposition [2] and Theorem [4] are com- 
putable online, we may utilize them to repeatedly adapt the optimization horizon. 
To this end, we extend our notation of the horizon length from N to N n , of the 
corresponding local suboptimality degree from a to a(N n ) and of the closed loop 
control law from Hn(-) to where n indicates the discrete time instant. Defin- 

ing a fixed suboptimality bound a e (0, 1), we propose the following algorithm to 
guarantee local suboptimality degree a: 

Algorithm 5. Set n := and choose a e (0, 1) and N„ e N. 

1. Obtain new measurements x{n). 

2. Set or = 0. While a < a do 

(a) Compute the open loop optimal control sequence w#(s x(n)) from d5]) 

(b) Compute suboptimality degree a := a(N„) from, e.g., Proposition [2jor 
Theorem |4] 

(c) If a > a: Call shortening strategy for N n 
Else: Call prolongation strategy for N n 

3. Implement n (Nn) (x(n)) := u(0, x(n)), set n : = n + 1 and goto Step 1. 

The problem which we are facing for such an adaptive MPC algorithm is the 
fact that none of the existing stability proofs, see, e.g., fl6l|71[8]|, can be applied 
in this context since these results assume Af to be constant while here the opti- 
mization horizon iV,, may change in every step of the MPC algorithm. The major 
obstacle to apply the idea of Proposition [2] in the context of varying optimization 
horizons N n is the lack of a common Lyapunov function along the closed loop. To 
compensate for this deficiency, we assume that if for a horizon length A^„ we have 
a(N n ) > a, then the controller shows a bounded guaranteed performance if N n is 
increased. For ease of notation, we give this assumption in a set valued manner, 
however, within the following stability proof it is only required to hold along the 
closed loop. 

Assumption 6. Given an initial value x e X and a horizon length N < oo such 
that /zjv(') guarantees local suboptimality degree a(N) >a,ae (0, 1), we assume 
that for N > N, N < oo, there exist constants C/, C a > such that the inequalities 

Vfi(x) - V]j(f(x,iJ. N (x)) 

l(x,n N (x)) < Cil(x,Hjj(x))—- — T7-— — (12) 

v i/( x )- Vff(f(x,fitf(x)) 

a(N) < ^a(N) (13) 
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hold where a(N) is the local suboptimality degree of the controller yu^(-) corre- 
sponding to the horizon length N. 

Note that Assumption [6] is indeed not very restrictive since we allow for non- 
monotone developments of the suboptimality degree «(•) if the horizon length is 
increased which may occur as shown in [3J. Moreover, we only make sure that 
if a certain suboptimality degree a e (0, 1) holds for a horizon length N, then the 
estimate a(N) does not drop below zero if the horizon length N is increased. 

Using Assumption [6] to hold along the closed loop, we obtain stability and a 
performance estimate of the closed loop for changing horizon lengths: 

Theorem 7. Consider a e (0, 1) and a sequence (N n ) ne ?q , Ni e N, where N* = 
max{Af„ | n e No), such that the NMPC feedback law fi(N„) defining the closed loop 
solution ([8]) guarantees 

V Ntt (x(n)) > V Nn (x(n + 1)) + al(x(n), H Nn (x( n ))) ( 14 ) 

for all n e N . Moreover suppose Assumption^to hold for all pairs (x(n),N n ), 
n e No- Then we obtain 

acVMn)) < a c VT"\x{n)) < V N +(x(n)) < KoOOO) (15) 

CJJ)- { 

7'eNoJ>« C,{ J ) 



to hold for all neNj with etc '■= mm 7^rl a an d C a (j), Ci(j)from ([12]), ([13]) for 



x = x(j), j>ne N . 

Proof. Given a pair (x(n),N n ), Assumption [6] guarantees a(N n ) < a(A^)/C„ ) for 
N > N n . Now we choose N = N* within this local suboptimality estimate. 
Hence, we obtain a < a(N n ) < a(N i ')/C a (n) using the relaxed Lyapunov in- 



equality ( fT4] ). Multiplying by l{x{n),p Nn {x{n))) and using ( [12] ), we can conclude 



al(x(n),u N (x(ri))) < ^\ (V N *(x(n)) - V N *(x(n + 1))). Since the latter condition 
relates the closed loop varying optimization horizon to a fixed one, it allows 
us to use an identical telescope sum argument as in the proof of [7, Proposi- 
tion 3]. Hence, summing the running costs along the closed loop trajectory re- 
veals a c Tj% n l(xU)'l u Nj(xU))) ^ VV (*(«)) - V N *(x(K + 1)) where we defined 



a c '■= min j?j4a with constants C a (j) and C/(j) from ([T2~|) and ([T3]) for x = x(j) 



je[n,...,K] Cl( j> 



and je {«,..., K}. Since Vn*(x(K + 1)) > holds, taking K to infinity reveals 

K 

a c VT n \x(n)) = a c lim V \l(x(j),p Nj (x(j))) < V N *(x(n)). 



,/=" 
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Since the first and the last inequality of ( [T5] ) hold by the principle of optimality, 
the assertion follows. □ 



Similar to Proposition [2j Theorem [7J can be extended to the practical case, 
cf. [16, Chapter 4]. We like to point out that Theorem [7J is a generalization of 
Proposition [2] which is reobtained if N„ = N for all n e N . 

Note that the closed loop estimate ac in ([15]) may be smaller than the local 
suboptimality bound a. In particular, since l(x,ju^(x)) may tend to zero if N is 
increased, we obtain that C/ in ( fT2| ) is in general unbounded. The special case 
l(x,Hfj{x)) = 0, however, states that the equilibrium of our problem has been 
reached and can be neglected in this context, i.e. outside the equilibrium or c > 
is always retained. Yet, ac may become very small depending on C a and C/ from 
Assumption[6j During our numerical experiments, however, no such case occured, 
see also Section [5] 

4. Adaptation Strategies 

As we have seen in Theorem [7J the methods from Proposition [2] and Theorem 
[4] can be applied to compute a suboptimality estimate for a given pair (x(ri),N n ). 
Yet, these local estimates have to reinterpreted along the closed loop. In particular, 
we require the existence of a finite horizon length N n guaranteeing stability with 
suboptimality degree greater than a in order to conclude finite termination of the 
algorithm proposed in Section [3j 

Assumption 8. Given a e (0, 1), for all x e X there exists a finite horizon length 
./V = N(xq) € N such that the relaxed Lyapunov inequality ([9]) holds with a(N) > a 
for all horizon lengths N > N. 

Note that Assumption's satisfied if a is small enough and N is large enough, 
see, e.g., J6j. 

4.1. Simple Adaptation Strategies 

A basic adaptation technique for the horizon length can be obtained using 
Proposition |2j In particular, we can repeatedly shorten the horizon length and 
check Assumption ([9]) as the solution evolves: 

Theorem 9. Consider an optimal control problem ([5]), ([6]) with initial value xq = 
x(n), N n 6 N and a £ (0, 1) to be fixed and denote the optimal control sequence 
by u*. Suppose there exists an integer k e No, < k < N n such that 

V Nn . k (x UN (k, xq)) > V Nn - k (x u *(k + l,x )) + al(x u *(k, x ),fi Nn . k (x u *(k, x ))) (16) 
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holds true for allQ < k < k. Then, setting N n+k = N n -k and fx Nn+k (x(n+k)) = u*(k) 
for < k < k, inequality (_14) holds for k = n, . . . ,n + k with a = a. 

Proof. The assertions follows directly from the fact that for fx Nn+k (x(n+k)) = u*(k) 



the closed loop satisfies x(n+k) = x u *(k, x{n)). Hence, ( fT4| ) follows from ( |16| ). □ 



Note that the result of Theorem[9]can be extended to consider an m-step feed- 
back as defined in (8]| by supposing k > m - 1. With the choice N n+k = N n — k, due 
to the principle of optimality we obtain that the optimal control problems within 
the next k NMPC iterations are already solved since fi Nn - k (x(n+k)) can be obtained 
from the optimal control sequence u*{-) computed at time n. This implies that the 
most efficient way for the reducing strategy is not to reduce N n itself but rather to 
reduce the horizons N n +k by k for the subsequent sampling instants n + 1 , . . . , n + k, 
i.e., we choose the initial guess of the horizon N„+\ = N n - 1. Still, if the a poste- 
riori estimate is used, the evaluation of < |T6] ) requires the solution of an additional 
optimal control problem in each step. 

In order to to use the a priori estimate given by Theorem[4]the following result 
can be used as a shortening strategy: 

Theorem 10. Consider an optimal control problem ([5]), ([6]) with initial value xq = 
x{n) and N n , N e N, N n > N > 2 and denote the optimal control sequence by u*. 
Moreover, a e (0, 1) is supposed to be fixed inducing some y(-) via ( fTTj ). If there 
exists an integer k e No, 0<k<N n -N- \ such that for allO < k < k there exist 
7n > 0, y n < y(N„ - k) satisfying 

Vff(x u *(N n - N, x Q )) < (y n + 1) max l(x u *(N n - j, xo),fij-i(x u *(N n - j, x ))) 

j=2,...,N 

(17) 

Vj k (x u *(N„ - j k , Xq)) < {j n + 1)1(X U *(N» - j k , X ), /J, jk (x u *(N n - j k , X ))) (18) 

for all jk e {N + 1, . . . , N n - k\. Then, setting N n+k = N„ — k and /j. Nn+k (x(n + k)) = 
u*(k)for < k < k, inequality { fT4] ) holds for k = n, . . . ,n + k with a = a. 



Proof. Since ([T7J, ((TgJ hold for k = with y„ > and y n < y(N n ), Theorem |4] 



guarantees that the local suboptimality degree is at least as large as a. If k > 
holds, we can make use of the fact that for p Nn+k (x(n + k)) = u*(k) the closed loop 
satisfies x(n + k) = x u *(k,x(n)). By ( [T7] ), ( [T8] ), we obtain Assumption [6] to hold 
along the closed loop. Accordingly, the assertion follows from Theorem [4] which 
concludes the proof. □ 
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Similar the a posteriori case, the shortening strategy given by Theorem 10 
can be extended to consider an m-step feedback as defined injp by supposing 
k > m - 1. Note that while the a priori estimate from Theorem |4j is slightly more 
conservative than the result from Proposition |2l it is also computationally less 
demanding if the value N is small. 



The shortening strategies induced by Theorems [9] and 10 can be extended to 



cover the case of practical stability. To this end, the inequalities ( p~6| ), (FT) and 
( fT8| ) have to be replaced by their practical equivalents given in [[Vj Proposition 14] 
and [7, Theorem 21]. 

In contrast to these efficient and simple shortening strategies it is quite difficult 
to obtain efficient methods for prolongating the optimization horizon TV,,. In order 
to provide a simple prolongating strategy, we invert the approach of Theorem [9} 

Theorem 11. Consider an optimal control problem ([5]), ([6]) with initial value 
x = x(n) and N„ e N. Moreover, for fixed a e (0,1) supposed Assumption^ 
to hold. Then, any algorithm which iteratively increases the optimization horizon 



N n terminates infinite time and computes a horizon length N n such that ( |T4| ) holds 
with local suboptimality degree a. 

Proof. Follows directly from Assumption [8] □ 



Note that the prolongation strategy described in Theorem 11 only requires 
Assumption [8] to hold. This allows us to use any of the suboptimality estimates 
stated in Section [2] Unfortunately, if ( fT4"] ) does not hold, it is in general difficult to 



assess by how much N n should be increased such that ( fT4[ ) holds for the increased 
N n . The most simple strategy of increasing N n by one in each iteration shows 
satisfactory results in practice, however, when starting the iteration with N n , in the 
worst case ( fT4"] ) has to be checked N - N n + 1 times at each sampling instant. In 
contrast to the shortening strategy, the principle of optimality cannot be used here 
to establish a relation between the optimal control problems for different N n and, 
moreover, these problems may exhibit different solution structures which makes 
it a hard task to provide a suitable initial guess for the optimization algorithm. 

4.2. Advanced Adaptation Strategies 

Since the shortening strategies based on both the a posteriori and the a priori 
estimates can be implemented with negligible additional computational effort, we 
focus on the prolongation of the horizon. To reduce the additional effort, we 
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analyze the relationsship of a(N n ) and y(N n ) given by Theorem |4} If we consider 
a € (0, 1), we obtain a lower bound for N n from (jTTJ) by 



N„ > 



\ 2\n(y(N n ))-\n(\-a) 
+ ln(y(N n ) + l)-ln(y(N n )) 



=: ®(N n ) (19) 



for fixed x(n), N and a. Since we want to guarantee local suboptimality degree 
a and N n to be as small as possible, we seek a horizon length N n satisfying N n = 
®(N n ), i.e. a fixed point of the function $(•)• 

Theorem 12. Consider N n ,N e N, N„ > N > 2, and a e (0,1) to be fixed and 
y(N n ) to minimally satisfy Assumption [5] If for a given n e No there exists a 



constant 6 e [0, 1) such that the function O(-) defined in (19) satisfies 



m®(N n )) - 0(AW| < 6\®(N n ) - N n \ 9 e [0, 1) ViV„ > N. (20) 

and <£> k (N n ) > N for all k e N, then there exists N* e N with N* = <&(N*) and 
0> k (N„) -» N*, k -> oo. If we use N„ = N* in Algorithm \5\for all n e N and 
Assumption^holds for x = x(n) and all n e No, then the closed loop solution ([8]) 
is asymptotically stable and exhibits local suboptimality degree a(N*) > a. 

Proof. Since y(N n ) satisfies all requirements of Theorem |4} we can obtain an es- 
timate a(N n ) via ( fTT| ). In order to guarantee a certain degree of suboptimality a 

(•y+iyVn-iV /V,,-iV+2 

we have to show a < a(N n ) = — — | 1)JV „_ A • This can be solved for N„ giving 



N„ > ®(N„) with O(-) from ( 19 ). Due to ffl we have 



!<&*(#„) - &-\N n )\ < tf-'mNn) ~ N n \. (21) 



Since 9 e [0, 1) the right hand side of ( |2T| ) tends to zero. Hence, there exists an 
index k eN such that 8 k ~ l \(E>(N n )-N n \ < 1. Defining the sequence of optimization 
horizons via (N^) ieNo := (<E»'(A^„)) ieNo we obtain Ni J) = > N for all j,k > k. 

Hence, (M°), eNo is converging and N* = <X>(iV*) holds for N* = N„\ 
Choosing N n = N*, the local suboptimality degree satisfies a(N„) > a by con- 
struction of O(-). Hence, a new initial value can be obtained by implementing 
the controller in a receding horizon fashion. Since we can apply this procedure 
along the resulting trajectory, i.e. for all n e No, asymptotic stability of the closed 
loop solution ([8]) follows by Assumption [6] and Theorem[7]and a(N*) > a follows 
directly from Theorem |4} □ 
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Note that, in general, we cannot a priori check whether O(-) satisfies (|2T)]) 



Moreover, an algorithm derived from Theorem 12 may cause overshoots. Numer- 
ical experience has shown that <x = 5 is a suitable choice to bound the change 
of the the horizon length, yet, this variable should be chosen with respect to the 
considered problem. Additionally, numerical simulations indicate that the "best" 
choice of a depends on the occuring horizon lengths N„, i.e. larger horizons allow 
for larger choices of <x. 



Theorem [T2j can also be utilized to shorten the horizon. However, the com- 
putation of N* requires nonnegliable effort. Hence, this strategy should only be 
considered when a(N) < a. Yet, Ni+\ := 0(iV,-) may be a suitable choice for the 
optimization horizon in the subsequent optimal control problem. 

Different from the fixed point idea, a map can be designed which gener- 
ates a sequence of horizons (N„ ) via A^ +1) := ^A^) such that the suboptimality 
estimate a(N^) is monotonely increasing: 

Lemma 13. Suppose N n ,N e N, N„ > N > 2, a e (0, 1) and < 6 < 1 - a(N n ) 
are given and Assumption [5] holds. Suppose there exists a constant & > such 
thaty(N) < fty(N n ) holds for 



N> 



N + 



In 



y(N, 



y(N, 



N„-N 



y(N„) 2 J 



21n(#) 



\n($y{N n ))-\n(Vy{N n ) + \) 



=: TO), 



(22) 



then a(N) as defined in ( |TTj ) with y = y(N) from Assumption\3jsatisfies 

a(N) > a(N n ) + 6 



(23) 



Proof. In order to show a(N) > a(N) + 6 we use ( |TTj ) in ( [23] ) which gives us 

(/■Cn \N-N I \N-N 

? N) <y(Nf 7(N) -6. 

Overestimating the left hand side using y(N) < &y(N) this leaves us to show 

,N~N / ,-kts \N-N 



A ; \#y(A0 + i 



< y(A0 2 



y(AQ 
r(A0 + 1 



to guarantee ((23]). Since # > 0, this inequality is equivalent to 



(n-n) 



In 



$y(N) + 1 



<ln 



y(AQ 

y(A0 + 1 



N-N 



y(N) 2 



-6 



-21n(#) 



Using negative definiteness of In ( J^^ ) and ( [22] ) the assertion follows. □ 
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Similar to O(-) from Theorem [12] the map ¥((•) may be used to shorten the 



horizon, a feature which can be avoided easily: 



Lemma 14. If§ >\andd>0 hold, then we have *P(N) > N for ^{-)from fl22p . 
Proof. In the special case 6 = 0, ( f22| ) simplifies to 

(N - N) (In (y(JV) + 1) - In (y(iV))) + 2 In (#) 



T^AO : = 



ln(#y(A0 + l)-ln(#y(A0) 



Since '" (A " + ,^"l n ^ » > 1 and & > 1 hold, we obtain ^(N) > N. Moreover, 
increasing 8 results in an enlarged value ^(N) showing the assertion. 



□ 



Obtaining a suitable approximation of ft is the most crucial part of the mono- 
tone prolongation method. In order to avoid computing the open loop optimal 
control ([5]) for all initial values xeX and all N > and deriving the correspond- 
ing values y(-) to obtain ft, one can iteratively update the value of § by setting 
§ := max y(M' +1) )/'y(M' ) )}. This method is not only computationally cheap 
and gives us a lower bound for •&, it also moderates a possible overshoot. Note 
that this approximation has to be restarted for each n e N . 



Next, we use Lemma 13 to show that a prolongation strategy based on *F(-) in 
Step 2 of Algorithm [5] terminates in finite time: 

Theorem 15. Suppose Assumptions [5] and |5| hold an d suppose ■& > 1. Then a 
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finite number of iterations N% + := ^(N^) with ¥(■) according to Lemma 
are required to obtain a horizon length N* which guarantees local suboptimality 
degree a(N*) > a. 

Proof. Using the stopping criterion a(N n ) > a of Algorithm [5] to define 8 = a - 
a(N n ), we always have 6 > 0. Hence, by Lemma [14] we can conclude that the 
horizon length is increasing in every step of the iteration due to & > 1 . Since 
Assumption [^guarantees the existence of a finite horizon length N e N, N < co, 
such that a{N n ) > a holds for all N„ > N, the iteration A# +1) := ^(N^) terminates 
in finite time. Hence, choosing N n = N* we have that a(N„) > a is guaranteed by 
the stopping criterion. □ 



Note that we do not assume y(-) in ( fT9| ) to be computed in a specific way 
but only to satisfy Assumption [3] Hence, using Theorems [T2] and 15 a suitable 
horizon N n can be obtained such that a(N n ) > a holds even if some y(-) > y(-) are 
used. Therefore, also the a priori practical estimate Theorem 21] or the global 
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estimate presented in [9, Theorem 4.5 and 5.8] can be applied. Additionally, ( [TTj ) 
defines a bijective mapping relating a(N n ) and y(N n ) which allows us to use the a 
posteriori estimates of Proposition [2] and [7, Proposition 14]: 

Lemma 16. Given N n ,N e N, N„,N > 2, the mapping F : [0, oo) — > (-oo, 1], 
r (*) ■= 1 - (x+iy v„-* « bijective. 

Proof. In order to show bijectivity, we use continuity of T(-) on [0, oo) and T(0) = 
1 and lim AWCO Y(x) = -oo to show surjectivity. In order to obtain injectivity, we 
show that T(-) is strictly monotone on [0, oo), i.e. for all x e [0, oo) and e > we 
have T(x) > Y(x + s). Using the definition of T(-) in this last inequality, we obtain 
(1 + s/(x + \)) N "-" < (1 + s/x) N "~" +2 which holds true for x> since N n -N > 0. 
Last, considering x = 0, we see that T(s) < 1 for all e > and hence the assertion 
follows. □ 



Now, Lemma [T6] allow s us to solve (TTTl) for y(N n ). Since T(-) is twice continu 



ously differentiable on [0, oo), this can be done effectively using Newton's method 

r «*'W,) :=y»(JV.) + ; , (24) 

(^'(Al-^i+W.)) 

where y (0 \N n ) := 1 can be used as the initial value due to strict monotonicity of 
T(-). However, since we do not expect y(N n ) to vary massively along the closed 
loop, the information from previous steps may be reused. 

Using the a posteriori estimate, we obtain an additional degree of freedom 
since N can be chosen arbitrarily within the set {2, . . . ,N n }. However, it is not 
clear which value is the best: For one, the smallest possible y(N n ) > is obtained 



by setting N := 2. Formulas { fT9| ) and ( |22l ), however, logarithmically depend on 



y(N n ), i.e small values of y(N n ) might lead to overshoots. 



5. Numerical Results 

To illustrate the effectiveness of our adaptation strategies, we consider a digital 
redesign problem (cf. lfT5"l ) of an arm/rotor/platform (ARP) model stated in 01 : 

xi(J) = x 2 (f) + x 6 (t)x 3 (t) 

x 2 (t) = -kiXi(f)/M - b\x 2 (t)IM + x 6 (t)x 4 (t) - mrbix 6 (t)/M 2 
x 3 (t) = -x 6 (t)xi(t) + x 4 (t) 
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x 4 (f) = -x 6 (t)x 2 (t) - hx 3 (t)/M - biX 4 (t)/M + mrki/M 2 
x 5 (t) = x 6 (t) 

x 6 (t) = -aix 5 (t) - a 2 x 6 (t) + aix 7 (t) + a 3 x s (t) - p\X\{t) - p 2 x 2 (t) 
x 7 (t) = x 8 (t) 

i 8 (0 = a 4 x 5 {t) + a 5 X(,{t) - a 4 x-j{i) - (a 5 + a^x^t) + u(t)/J 

For the digital redesign a continuous time full-state feedback uq was developed 
via backstepping such that the output g(t) := xs(t) - a{ "l lCIJ [xe(t) - a^if)] is close 
to x 5 (t) and tracks a given reference signal f re f(-), see ^ Chapter 7.3.2] for details 
on the backstepping design and the specification of the model parameters. The 
resulting continuous time solution of X5 ire f(-) is then used as a reference to compute 
a sampled-data control with zero order hold, cf. [TT5l . To solve the sequence of 
optimal control problems we use a direct approach and employ an SQP method to 
solve the resulting optimization problem. Here, we consider the initial value x = 

(0, 0, 0, 0, 10, 0, 0, 0) T , the cost functional J N (x , u) = Zto f + ' l*s(0 - x 5M (t)\dt 

jtj 

where the sampling instances are equidistantly fixed via t\ = t + iT with T = 0.2, 
set tolerance levels of both the minimizer and the differential equation solver to 
10~ 6 and use reference function 




10, te [0,5) U [9, 10) 
0, te [5,9) U [10, 15) 



cf. Figure [TJ a). For this problem it is known from Section 5] that the subop- 




9 10 



15 



t 



(a) Tracking function f re f(0 (solid) and re- 
design reference X5 >re f(0 (dashed) 



5 9 10 15 

(b) Horizons for adaptive NMPC with a pos- 
teriori estimate (solid) and adaptive NMPC 
with a priori estimate (dashed) 



Figure 1 : Tracking function and horizons of the adaptation scheme 



timality estimates of Proposition [2] and Theorem [4] show only poor performance. 
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For this reason we employ the practical variants [7, Proposition 14 and Theorem 
21] with e = 10~ 5 and set the lower bound a = 0.5. 

Using the a posteriori and a priori estimation techniques within Algorithm |5j we 
obtain the evolutions of horizons N„ along the closed loop for the suboptimal- 
ity bound a = 0.5 as displayed in Figure [jjb). In particular, one observes that 
the less conservative a posteriori algorithms yields smaller optimization horizons 
which makes the resulting scheme computationally more efficient, see also Table 
[T] However, the evaluation of the a posteriori criterion itself is computationally 
more demanding, see also Figure [2| below. 



Adaptive NMPC 


Time in [10 




Horizon length 


Implementation 


Estimate 


max 


min 





max 


min 





Standard NMPC 




86.50 


3.23 


23.14 


6 


6 


6 


Theorem 


11 




a posteriori 


183.19 


1.30 


13.37 


6 


2 


2.39 


Theorem 


12 




a posteriori 


184.79 


1.49 


11.55 


6 


2 


2.25 


Theorem 


15 




a posteriori 


183.16 


1.48 


16.28 


6 


2 


2.43 


Theorem 


11 




a priori 


226.10 


2.12 


17.46 


6 


3 


3.21 


Theorem 


12 




a priori 


219.32 


1.45 


18.96 


6 


3 


3.19 


Theorem 


15 




a priori 


219.44 


1.47 


15.60 


6 


3 


3.13 



Table 1 : Comparison of NMPC results in the tracking type example 



It is also interesting to compare these horizons to the standard NMPC Algo- 
rithm with fixed N which needs a horizon of N = 6 in order to guarantee a > a 
along the closed loop. Here, one observes that the required horizon N n for the 
adaptive NMPC approach is typically smaller than N = 6 for both the a posteriori 
and the a priori estimate based variant. From Figure |TJb) one also observes that 
the horizon is increased at the jump points of the reference function (•), which 
is the behavior one would expect in a "critical" situation and nicely reflects the 
ability of the adaptive horizon algorithm to adapt to the new situation. 
Although the algorithm chooses to modify the horizon length throughout the run 
of the closed loop, one can barely see a difference between the resulting x 5 (-) 
trajectories and the (dashed) reference trajectory given in Figure [jja)- For this 
reason, we do not display the closed loop solutions. Instead, we additionally plot- 
ted the computing times of the two adaptive NMPC variants in Figure [2j Again, 
one can immediately see the spikes in the graph right at the points in which 
jumps. This figure also illustrates the disadvantage of the algorithm of having 
to solve multiple additional optimal control problems whenever N„ is increased, 
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which clearly shows up in the higher computation times at these points, in partic- 
ular for the computationally more expensive a posteriori estimate. 




Figure 2: Computing times of the standard NMPC (solid), adaptive NMPC with a 
posteriori estimate (dotted) and adaptive NMPC with a priori estimate (dashed) 



Last, we can use Theorem [7J to obtain the closed loop suboptimality degree 
a c of the different implementations shown in Table [2] Note that we can restrict 
ourselves to those time instances where /(•, •) > e holds and that according to 
Theorem [7j ac is given by combinations of Ci(n) and C a {n) while Table [2] shows 
minimal and maximal values of C/ and C a . 



Adaptive NMPC 


Q 


C a 


ac 


Implementation 


Estimate 


min 


max 


min 


max 




Theorem 


11 




a posteriori 


0.9959 


1.1282 


1.0000 


1.0506 


0.4431 


Theorem 


12 




a posteriori 


1.1078 


1.1078 


1.0000 


1.0000 


0.4513 


Theorem 


15 




a posteriori 


0.9638 


1.1760 


1.0000 


1.1088 


0.4291 


Theorem 


11 




a priori 


0.8824 


1.1460 


1.0010 


1.8321 


0.4751 


Theorem 


12 




a priori 


0.9480 


1.5334 


0.9816 


1.6457 


0.3366 


Theorem 


15 




a priori 


0.8835 


2.0362 


0.9856 


1.8360 


0.2535 



Table 2: Values of C/, C a and a c of Theorem [7] 



From Table [2] we obtain that ac may deteriorate if the a priori estimate is used 
while results based on the a posteriori estimate show that a c is close to the min- 
imal local suboptimality degree a = 0.5. In either case, the presented adaptation 
strategies guarantee stability of the closed loop and show a satisfactoring local and 
closed loop suboptimality degree. 
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6. Conclusion 



We derived various adaptation strategies for the horizon length of an NMPC 
controller and showed stability and suboptimality of the resulting closed loop tra- 
jectory. Moreover, we have shown the practicability and effectiveness of these 
methods. Future work concerns many parts of this method. Probably the most im- 
portant point is to improve the a priori estimates from Theorem[4]and [7 , Theorem 
20] by a more detailed analysis of the parameter N and to develop other efficiently 
computable suboptimality estimates. Moreover, different feasibility conditions as 
well as development and investigation of alternatives to prolongate or shorten the 
optimization horizon will be an issue. In particular, combinations of iterates may 
allow for further insight of the process under control. 

This work was supported by DFG Grant Grl569/12 within the Priority Research 
Program 1305 and the Leopoldina Fellowship Programme LPDS 2009-36. 
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